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(57) Abstract , 

Methods of designing .or^ .modifying pro- 
tein structure at the protein or genetic level to 
produce specified amino termini in vivo or in vit- 
ro. The rrtethods can be used to alter the meta- 
bolic stability and other properties of the protein 
or, alternatively, to artificially generate authentic 
amino-termini in proteins produced through ar- 
tificial means. The methods are based upon the 
introduction of the use of artificial ubiquitin- 
profein fusions, and the discovery that the in vivo 
half-life of a protein is a function of the amino- 
terminal amino acid of the protein.' 
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METHODS OF REGULATING METABOLIC 
STABILITY OF PROTEINS 



Background of the Invention ' 

In both bacterial and eukaryotic cells , rela- 
tively long-lived proteins, whose half -lives are 
close to or exceed the cell generation time, coexist 
with proteins whose half-lives can be less than one 
percent of the cell generation time. Rates of 
intracellular protein degradation are a function of 
the cell's physiological state, and appear to be 
controlled differentially for individual proteins* 
In particular, damaged and otherwise abnormal 
proteins are metabolically unstable in vivo. 
Although the specific functions of selective protein 
degradation are in most cases still unknown; it is 
clear that many regulatory proteins are extremely 
short-lived in vivo . Metabolic instability of such 
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proteins allows for. rapid adjustment of their 
intracellular . concentrations, through regulated. . 
changes in rates of .their, synthesis or- degradation. 
The few instances in which. the metabolic instability 
of an intracellular protein has been shown.to.be 
essential for its function- include the. cXI protein 
of bacteriophage lambda . and the HO endonuclease- of 
the yeast Saccharomyces cerevisiae. ; 

Most of the selective turnover of intracellular 
proteins under normal metabolic. conditions is 
ATP-dependeht = and ( in eukaryotes ) nonlysospmal . 
Recent - biochemical • and genetic evidence indicates 
that, in eukaryotes/ - covalent conjugation of ubi- 
-quitin- to short-lived intracellular- proteins is 

essential for their selective degradation. The 
rules which determine whether a given protein is 
metabolically stable or unstable: in viyo were . . 
previously unknown. ' - . : . : 

Summary of the invention 

' This invention pertains to methods of- 
engineering the amino terminus of proteins thereby 
-controlling the metabolic . stability, and other f ; : 
properties of a protein.. Further, this invention, 
- provides a method for either in vivo or in vitro 
production of proteins with any of the twenty amino 
acid residues (or: analogs ^thereof ) at the protein's 
amino-terminus . The ..invention is based in-, part upon 
the" striking discovery that, the in vivo half-life of 
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an intracellular ^prbtein^is^ar function. of its 
amino-terminal aMno • acid~residue and upon a.^novel 
(and more generally applicable;),.; technique : that . ^ 
allows J one to generate proteins with; specified 
amirtb-termini^inr vivo ore in . vitro - • The. invention 
also pertains to' a neWly. lidenfcif ied.^protease,. . • • 
ubiquitin-specif, ic process ing^protease, - which has. 
properties: that -allow one to expose, either-.in vitro 
or in' vivo , any desired; aminol acid . residue at the 
amino-termirtus of a: protein. of interest. 

*■ The nature of the:- amino^acid,,. exposed : ; at,, the 
amino-terminus ofvan intracellular protein was -shown 
to be one crucial -determinants that .specifies whether 
a protein * will be longer or shartrliyed;- in vivo. ;: .- f 
Individual- -amino, acids, can 'be .categorized as either 
stabilizing*; or destabilizing:- amino acids with 
respect to the half-life that they^cpnf er upon a 
protein when exposed at the protein's amino- 
terminus. Destabilizing amino. i acrid ^siduesv confer 
short half -lives.,/ down, to. a.. feviiminutes for some of 
the destabilizing; amino acids-. stabilizing? amino 
acid residues confer long" half olives of . . many ■ hours . 
This striking and newly discovered dependency of a 
protein f s half-life; on. itsi amino-terminal residue -is 
referred to herein as the N-'end: rule.; ; r v . . 

; Based upon the N-endrule, the. amino-terminus 
of a protein- can thus be. designed or ; altered to 
change the intracellular vhalf-life of the protein 
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and in this way the lifetime and/or activity of the 
protein irc vivo , can be . regulated.. This * capability- 
can be exploited for. rational -protein design in many 
different contexts.. -..Natural* or -wild type proteins 
can ,be, modified, to render/them more* or less- ' . . 
resistant to degradation in vivo:* - The design or 
alteration of the protein /can be done at the protein 
level or at the .genetic : (DNA) level.- - For example, . 
proteins , can/ be. ;modified-by ' chemically- altering or 
engineering the amirio-terminus . to provide for 
exposure at the; amino-terminus 6f "an amino acid 
residue of the stabilizing or destabilizing class." 
At the genetic level, .genes "encoding proteins can be 
made .to , encode an ~amino. acid ..of - the desired" class at 
the amino-terminus so /that the- expressed protein 
exhibits: a predetermined amino-terminal structure « 
which renders It either metabo-Iicaily stable or 
unstable with respect to the N-end rule pathway of 
proteolytic* degradation. Furthermore, proteins dan 
be expressed fused to ■ a- -"masking" protein -sequence 
which masks the engineered amino-terminus v so that 
when unmasked; the protein will exhibit the desired 
stability or other properties that depend on the 
nature of the protein 1 s amino -terminal residue. In 
such constructs, for example, the junction between ' 
the two protein sequences can be- designed, to be 
cleaved -specifically, .for instance, by an endopro- 
tease. Endoproteolytic -cleavage of ; the fused 
sequence unmasks the . 
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specifically; engineered \aminot-terminus of the* - 
. protein of interest; and subjects, the protein to 
degradation, governed a by! the. N^end" rule. One- spe- 
cific and . new way . to engineer , rthe . protein 1 s amino- 1 
terminus is* provided in this iriviention by the 
identification; of . ubiquitin-specific- processing 
protease and .determination of. its substrate speci- 
ficity*. Using this ^protease* fusions of ubiquitin . 
with other proteins can be specifically processed * 
either in vitro or in vivo to generate proteins with 
desired aminq-terminal .residuesi 

A, different i and _als.o t: new way-.to ..specifically 
engineer . short-lived ; proteins is- -provided in "this .' r- 
invention, by -the. discovery that ubiquitin-protein J 
fusions, such as ,ubiquitin-Pro- ^-galactosida'se, ■" . ^ 
that cannot be efficiently deubiquitinated, are . is 
metabolically unstable. . ;Thus, by attaching the r* 
amino-terminal ubiquitin moiety 1 to a protein in a: ^ 
way that makes its removal either impossible, or 
inefficient, one can destabilize proteins by a 
distinct technique that*, is not directly based on the 
N-end rule, t . ^ „_ . : „ /: 

In . additioa, variant qells can ; be developed = 
which contain putative mutations ; in the "Nrend" 
degrading: protease (s) .which -either conditionally or 
nonconditionally stop degrading shortr lived .pro- 
teins. ; These cells can be used to -overproduce - 
proteins that ordinarily would be short-lived within 
the cell. 
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lacZ gene fusions.. 

" Figure 2 shows' "ekperim'ents'' in which the' half - 
lives of engineered" ^'-gal^proteihs- are directly 

measured. " \ 

Figure 3 shows "the changing of amino acid' 
residues at the ubiqiiitih- $ 4 a l junction' (A) using 
the newly discovered jprbjperties of ubiquitin-" 
specific processing protease and the amino acid 
sequence in the vicinity' of the junction- (B) . 

Figure 4 showi l th§ ? presence of multiple ubi- 
quitin moieties in Saeta^ol icaliy unstable : f -gal 

proteins. ' " , ■' . '* 

Figure 5. shows a series of p -gal species 
containing ubiquitiri'in metabdlically unstable • 

" p-gal proteins. 

Figure 6 shows that both prokaryotic and 
eukaryotic loncf-lived ' intracellular ' proteins have ' 
stabilizing amino acid residues at their amino 
termini whereas secreted proteins exhibit a com- 
plementary bias. " .... 

Detailed Description of the Invention 

' ! The elucidation of the' It-end rule is described 
"in detail below. " Briefly, this rule governing 
protein degradation was revealed by examining the in 
vivo' half-lives of the enzyme ^-galactosidase having 
various amino acid residues at its amind-terminus 
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and produced, as a fusion ^prpteiVi with ubiquitin. 
When a chimeric gene encoding a ubiquitin-^ -galac- 
tosidase.. fusion protein is expressed in the yeast S. 
cerevis.iae , „, ubiquitin, is, gleayed off the nascent 
fusion protein, yielding a deubiquitinated-y^-galac- 
tosidase ( ^gal) . , With, one .exception, this clea- 
vage takes place regardless pf the riature of the 
amino acid residue of /?gal at t the ubiquitin-^3 gal 
junction, thereby makiiig it possible to expose 
selectively different .residues at the aminq-termini 

of otherwise identical ^gaj. . proteins • The f2 gal 

proteins so designed exhibited, strikingly different 
half-lives in vivo , ranging from more than 2 0 hours 
to less than . 3 minutes, depending upon the nature of 
the amino acid ; at the amino-terminus . of ^ gal. 
Amino acids can be thus ordered according to the the 
half lives they confer on y^gal^when present at its 
amino-terminus. . .For example,. the aminp acids 
methionine, serine, alanine, threonin^, valine, ^ , 
glycine and cysteine confer a half-life of more than 
20 hours. Phenylalanine, leucine, asparagine, t and 
lysine yield half lives of about three minutes, 
Arginine, the most destabilizing .a.mino acid, : confers 
a half life of about two minutest 

for complete list of amino acid$ and the correspond- 
ing half-lives) . ; 4 . ... . . , ?; 

Currently known amino-terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins from both prokaryotes and eukaryotes belong 
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virtually exclusively to the stabilizing class of 
amino acids, exactly as predicted ;by : .the N-end rule. 
This result . stongly implicates the N^end r rule' in the 
selective degradation of intracellular proteins in 
general.-.- : . r ,-: ■ ■:■ ; . ; ^ 1 . 

t The v appropriate amino-terminal amino acid: 
appears. to be. an - essential (though^ not necessarily a 
sufficient) requirement for : the metabolic stability 
of a ^oncompartm^ntalized, -intracellular protein. 
Thus, in order for a ; protein to be relatively stable 
intracellularly, a stabilizing amino - acid .should be 
present at the, amino-terTriinus * The presence of a 
destabilizing residue :C at jthe amino- terminus of a 
protein is - oft.en r . though not always / : sufficient for 
its metabolic, destabilize in : vivo- . - .When such r 
des;tabilization occurs to a relatively small extent, 
further analysis shows -either, an insufficient:; 
accessibility of : -the : amino-terminus or: a lack of ; . 
"permissible-" sequence environment in the. vicinity 
of the a^ino^terminus such as a lack of : segmental 
mobility ^ in - the protein's amino-terminal region. 
The presence of a- stabilizing- amino acid at . the 
amino-terminus at least in some cases (for instance., 
as observed, for ^-gal) .will confer stability upon 
the protein.;, -However, arstabilizing amino acid at 
the amino-terminus may... not -always -confer a long ! 
half-rlife because < other tdegradative- pathways may be 
involved, in- determining the. ultimate ^fate of pro-- 
tein. For example,, iendqproteolytic . cleavages ' 
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(cleavages outside r o£ -terminal 'regibhs of the ; 
. . protein). - may result ' in 1 expostfre of a destabilizing 
amino acid": at the amino^terirtxnus of a resulting ' 
product of the cleavage which is then rapidly 
degraded -via the- f N—eriH rulb pathway • Appropriate 
circumstances ; for use of a stabilizing' amino acid ' 
can be ascertained empirically. l>r - : 

Although *the J N^end rule ihay be' only one com- 
ponent V (albeit a central one)' of a more complex 
"half ^life rule" which embraces "other aspects of 1 
selective protein degradation in vivo ; the N-ehd - 
rule provides a rational/' practicable approach for 
designing ; dr changing protein structure'' in- order to 
produce proteins Which are* more or less 1 resistant tb 
degradation 'by the N-^end 'rule ^pathway than natural; 
unmodified protein, • Proteins can be' designed" or 
modified .at the protein or gene -leVei' to provide"" a 
desired amino acid "of either the stabilizing or 
destabilizing class at their amino^terminus . The : 
ability-- to regulate the half-li^f e ; ! of a protein will 
allow one to, modulate the; intracellular activity of 
the , protein ■ <- ". f y/^./^ xr \ : 

.A strai^htf orward; appirtsach" to modifyihg a ' 
protein to increase or decreases-its metabolic 
stability is ..to directly engineer the amirio-tdrmirius 
of the protein at the; protein level • ?! To provide a 
desired amino-terminal amino acid, the amino- v 
terminus of ■ the protein of -interest can be chemical- 
ly altered, for example, by adding an amino acid of 



BNSDOCID: <WO 88O2406A2J_> 



-10- 



the stabilizing or destabilizing. class to the 
amino-termihus* of a protein or polypeptide, em- 
ploying an appropriate chemistry. Thus, for. ex- 
ample, an unstable protein can be rendered more 
stable by adding a stabilizing amino acid residue 
(e.g. methionine serine^ alanine, threonine, valine 
glycine or cysteine) to the amino-terminus of the 
protein. Conversely, a stable protein can be 
destabilized by adding a destabilizing amino acid to 
the amino-terminus. One distinct way to modify the 
amirio-terminus of a protein would be to .employ spe-. 
df£c enzymes, amino acid-protein lig^ses, which 
catalyze posttranslatiohal addition of a single 
amino acid to the protein amino-terminus. Other 
methods for nongenetic alterations of the same type 
can reaciily be ascertained by those skilled in the 

art. " "* * / . . . . . • t , ] . 

In some proteins, the amino-terminal, end is 
obscured as a result of the protein's conformation^ 
(i.e. , its tertiary or quaternary structure).. In 
these cases, more extensive alteration .of the 
amino-terminus may be necessary to make the protein 
subject to /the N-end rule pathway. For example, . .. ., 
where simple addition or replacement of the single 
amino-terminal residue ? is insufficient because. of an 
inaccessible amino-terminus, several amino acids 
(including lysine, the site of ubiquitin joining to. 
substrate proteins) may b.e added .to the initial ^ - 



BNSDOC1D: <WO 8802406A2J_>. 



WO 88/02406 PCT/US87/02522 



-11- 

amino-terminus to increase the accessibility and/ or 
segmental mobility of the engineered amino-terminus - 

.Modification or design of the '.amino-terminus of 
a protein can also be accomplished at the genetic 
level: Conventional techniques of site directed 
mutagenesis for addition or substitution of ap- 
propriate codons to the 5 1 end of an isolated or 
synthesized gene can be employed to provide a . 
desired amino-terminal structure for .the encoded 
protein • For example, so that the .protein expressed 
has the desired amino acid at its .amino-terminus .the 
appropriate codon for a stabilizing amino acid can _ 
be inserted or built into the amino-terminus of the 
protein-encoding sequence. :> 

At "the same time, expressed proteins are often 
naturally "modified within a cell after translation - 
This can include modification of the amino-terminus. 
For example, the aminb-terminus can be acted on by , 
an aminopeptidase Which cleaves one or several amino 
acids from the amino-terminus. Amino acids may also 
be added to the amino-terminus by post-translational 
processing. This invention provides a way to 
"by-pass" still undefined rules 6i amino-terminal 
protein' processing to expose exactly and specifical- 
ly the desired amino acid residues at the amino- 
terminus of a mature processed protein species. To 
minimize the impact of such posttransiatiorial events 
on the ultimate structure of the amino-terminus of a 
protein of interest, specific fusion proteins can be 
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desxgned wherein the amino-terminus of a protein of 
interest -(-designed to^ haVe the desired stabilizing' 
or destabilizing structure) is preceded by a : :: 
"masking" ' protein Sequence-fused to the^mino : - : 
terminus. - The fusion " proteins are^ designed so that 
the protein sequent*, fused to the amino- terminus of 
the masking protein of interest is susceptible to 
specxfic cleavage at the junction between the two. 
Removal of the protein^eguence thus^ unmasks the '■ 
ammo-terminus of the protein ' of interest and the 
half-life of the released protein is thus" governed 
by the predesigned amino-terminus . The fusion ' 
protein .can be designed' for specif' ic Cleavage ' in 
Zivo, for Sample, by a : *bst "ceil endoprotease ^or ' 
for specific cleavage in a in vitr6^ys t e m where it 
can.be cleaved after Eolation from a producer cell 
(whxch lacks the' capability to cleave the fusion 
protein)'. 1 *- . "v ■ ■ » .- • : - im . 

ubiquitin is a broadly useful fusion partner ' 
f C ° nSt ^ CtiM °* * *™* Protein wi th a protein 
of interest: -the discover, that artifical ubiquitin- 
prote ln fuslbns can be cleaved precisely by 
cytoplasmic eukaryotic protease with" little or no 
dependence on the protein to which ubiquitin is ' 
fused ean be appliedboth ^ vivo and in vitro' in 
Protem engineering' strategies ,' and is a So7 
aspect of -this: invehtibn.-' *or example, the ubiquitin- 
protein fusion Method can be used to artificallf 
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generate authentic ,amin.o-tepiini in proteins pro- 
duced through ; artifiqal means-, t ., a?hus, : amino-termini 
characteristic of. na.tur^l. : euJcaryotic , or prokaryotic 
proteins can be generated, by in vitro cleavage of 
ubiguitin-protein fusions, prpduced , in a prokaryotic 

host. . ^ , , 

A specific methodology for producing. ubiquitin- 
p -galactosidase.. fusion proteins .,.is described in. 
detail, below. Genes, encoding any other proteins can 
be substituted for LacZ (the ^ -gal gene) . in . this 
methodology. . . , f . 

.In general, ubiquitin , fusion proteins are 
expressed by a chimeric gene construct comprising, r 
in 5 * to 3' orientation^ a ubiquitin -gene linked to 
a gene encoding the protein of interest.. The . codon 
for the amino-terminal amino acid ^ of ..the protein of 
interest is located immediately adjacent the 3 • end. 
of the ubiquitin gene. The fused gene, product, is 
cleaved endoproteolytically either, in vivo or in , 
vitro (using either pure or partially purjjr.ied 
ubiquitin-specif ic protease, identified, in f the 
present invention) at the. junction between; ubiquit in 
and the protein of interest to .generate the protein 
of interest having the de^i^ed amino, acid at its 
amino-terminus. . There " aire a , number of specific. uses 
for the described ability" to specifically engineer 
the protein's amino-terminus . One„ such. use,, is 
established by the fact that the intracellular 
half-life of the released protein is governed by the 
principles of the N-end rule. Other applications of 
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une speciric metnoa for engineering the protein 
amino-terminus" described herein 'range 'from adjusting 
the desired ' functional properties of a protein of 
interest/ to modulating itis antigenicity, and again, 
to other uses that can readily be ascertained by 
those skilled in the art. ' 

' - This method of generating the desired amino 
acid residue at the amino-terminus of a protein of 
interest involves" two hovel components: one,' the 
use of ubiquitih-protein fusions, and the other, the 
use of ubi^itih-specific processing protease that 
has been identified/' "anaV whose striking' substrate 
requirements were discovered, in this work. Al- 
though the initial identification' of the ubiquitin- 
specif ic protease has been made in vivo' , ' the enzyme 
is also relatively' stable" and active in vitro (in 
extracts), and "can readily be' purified to homogeneity 
by techniques "known to those skilled in the art . 
Furthermore, the substrate specificity of the ' 
ubiquitin-specific "processing protease is highly 
conserved in evolution; being the same in yeast and 
mammals.- The enzyme can be purified chromato- 
graphically from a' crude : extract by sequential 
chromatography" on phbsphoceilulose, DEAE cellulose, 
and SHSepharose among other methods known to those 
skilled in the art. Alternatively, the gene- for 
this Protease can be cloned by those skilled in the art. 
The cloned protease gene can be used either in vivo, 
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or, alternatively, the gene ..can be overe^xpressed in 
a suitable host, the oyerexpressed, ubiquitin- 
specific protease purified, and used ; far the same or 
similar purposes in vitro * .The discovery of this 
enzyme, and detailed characterization of its sub- , . 
strate specificity herein provides^ for . the in vitro 
and in vivo use of this enzyme. 

The use of ubiquitin-protein , fusions to, allow . 
the generation of a desired amino acid residue at 
the amino-terminus. of a protein of interest can be 
extended to facilitate the purification of such, . 
proteins from producer cells. A gene can 4 be readily 
constructed that encodes a convenient .marker pro- 
tein, such as streptavidin, linked to a ubiquitin- 
protein fusion construct described above. .The. 
resulting" (marker protein) -ubiquitin-protein , fusion 
can be simply isolated from producer cells by r using , 
the preselected property of the marker, protein,, for 
instance, the known ability of streptavidin tq ie ? 
isolatable by affinity chromatography on a biotin 
column. Thus, purified (marker protein) -ubiquitin- 
protein fusion can then be specifically cleaved by^ 
the ubiquitin-specif ic protease described in this, , 
invention to generate the final product, a protein . . 
of interest with the desired amino acid residue at : 
its amino-terminus. r 

The codon for the amino-terminal amino acid of 
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mutagenesis teconi-gues .currency standard in tne 
field. , If the gene encoding' ithe. protein- .of interest 
is a. synthetic gene -the appropriate 5 ' 'codbn- can be 
built-in during; the syntheticvprocess." Alter- • 
natively, • nucleotides , for a specific codon can be - 
added to the 5 ' end of an isolated or synthesized 
gene- by ligation- of . an appropriate DNA' sequence to 
the- 5.'. (amino-terminus: encoding) end- of the' gene . 

Ubiquitin-like, fusion; partners capable of being 
cleaved by the- ubiquitinfspecific' protease can also 
be usedv . In addition, v fusion- partners other then 
iibiquitin. for masking the"'amino-terminus of a- 
protein of interest .cari 'fae usedl >'In ;! appropriate 
cases> the fusion >proteins can be designed^ to 
contain a proteolytic. "cleavage site for a restric- 
tion eridoprqtease which, has sufficiently -narrow- 
specificity 'so. that .only -one target site is cleaved 
in a fusion" protein; ' A crucial property of such a 
protease must be" a. sufficiently relaxed requirement ' 
for the nature' of the i amino acid 'residue fs) abutting 
the carboxy-terminal 1 . side of - the cleavage site. • The 
target site for cleavage is' the junction between the 
fusion partner and the amino-terminus of the protein 
of interest and thus, the' -recognition site for the 
endoproteasev is located to provide for cleavage at 
this location ." ■ The commericaily available' protease; 
complement factor X exhibits these properties and 
thus can be used to directly- generate proteins with 
predetermined -amino' acid^- residues in the- ultimate 



BNSDOCID: <WO 8802406A2_I_> 



WO 88/02406 PCT/US87/02522 



-17- 

position of their -aminbrterinirii-' (see, ■ K. Nogai Vahd >r 
H . C . iThogers en Nature^ Q9:810- fl 9 8 4 ) ) . ■ The recog- 
nition site for; the en^oprptease can be engineered 
into the: -junction-, between-: -the* tasking protein 
sequence and the, 3:'i region encoding the amino- 
terminus of the protein:, of interest. - 

A different' and* distinct -method . for engineering 
short-lived proteins is' provided, in this invention 1 
by the; discovery that; ubiquitin-protein fusions, 
such as. ubiquitin-Pro-y?-galactosidase ^fusion (Table 
1) , that cannot be; efficiently deubiquitinated are > 
metabolically unstable. Thus; by. attaching the'. . : 
amino-terminal: ,ubiquitin- moiety ,to . a protein in -a 
way that makes its removal either ^impossible or . 
inefficient, one can. destabilize .. a protein by a 
distinct technique which is qualitatively different^ 
from the method, of generating the -.desired • .■ . 
amino-terminus of a protein according to the . 
requirements of .the N-end rule. Prevention of the.,, 
efficient deubiquitination of a . ubiquit.in-protein 
fusion can, be achieved in several ways, for in- 
stance, by using a proline residue at the . ^ . 
ubiquitin r protein ; junctionals shown in Table 1, or 
by changing the amino acid: sequence of iubiquit in 
near its carboxyl -terminus in such as way that the 
ubiquitin moiety is no longer, ..recognized by. the 
ubiquitin-specif ic processing protease but can still 
be recognised by the rest of . the* degradative path- 
way. These and other ways ^-to reduce ' the * rate of ' 
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deubiquitination of a ubiquitin-protein fusion can 
be readily -ascertained : by : those 'skilled ' in the; art. 
The methods of this invention can be employed, 
inter alia , for regulating the" half -life of a 
protein 'intracellular iyv v ' * "There ' are many instances 
where this capability is useful.* For example, when 
a gene is introduced into ' a cell' for expression 
therein, the ^expressed product can be designed for a 
long or short half-life. depending upon the particu- 
lar heed. ^ - : 

In "general, destabilized proteins which have 
short -half -lives are more amenable to regulation of 
intracellular levels of the protein. The ability to 
finely regulate* the. intra cellular levels and ac- " 
tivity of a protein can be useful in therapy or in 
the work ; with* in vitro cell cultures. in gene 
therapy/ for' example/ a ^ene may be introduced" into 
a cell to compensate for a genetic deficiency or * 
abnormality. The" gene Can be inserted under control 
of an inducible promoter. Induction results in 
enhanced expression bf the 7 gene product and con- 
sequently', higher levels of the product within the 
cell. If the genei is designed to encode an unstable 
protein, the - intracellular concentration of the 
expressed' proteih" Will be 'more quickly responsive to 
a later ! reductibn' r ini±e^rate of its synthesis 
because "it does not pfefsist within the cell. : In 
this 'wayV the^ intracellular level and/or activity of 
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the protein encoded Jby the. . inserted gene can be; moire 
finely regulated. ; ^ - - 

The method of this invention can also be used 
to expand the uses of selectable markers, by 
shortening the time necessary for. a -phenotype 
related to the marker, to . become manifest. Toward - 
this end, a product encoded by a , marker, gene can- be 
destabilized by altering, its amino-terminus ac- s - 
cording to the N-end rule. In this way selection " 
for the negative phenotype . can he facilitated- 
because the product of , the ^marker r gene, will be more 
quickly extinguished after function -of the gene -i 
encoding the marker is abolished. . An example -is. the 
thymidine kinase (tk) genet. : The, tk ; gene can be -; * v * : 
engineered to encode a less stable enzyme by, intro-. 
ducing an appropriate destabilizing amino acid at : 
the amino-term.inus . Gene mut-at ion, resulting in tk 
phenotype will be more quickly manifested; by cell^s . 
because residual tk . will : be more quickly degraded.^ 
This can be especially useful in slow. .growing cells 
where more time is required to "-dilute, out" tk 
synthesized prior to transforation to the tk" type.- 

The* principles of protein .modification based. >t 
upon the N-end rule may also be employed in, the . ..- , 
design of cytotoxics. Prqteinaceo.us cytofcqxins can . 
be designed as unstable proteins degradable by the 
N-end rule pathway so that they dp nqt persist after 
their toxic action has been exerted on a target 



BNSDOCID: <WO 8802406 A 2 I > 



WO 88/02406 PCT/US87/02522 



-20- 

cell. Reducing the lifetime of the toxin reduces 
the likelihood of killing noh'targeted ' cells." ' - 

Discovery 'of the N-end' rule" pathway '' of degra- 
dation allows development of mutant 'ceils having 
mutations in genes. encoding essential components of 
the N-end rule pathway; For example/" cells can be ' 
produced that either permanently "or' conditionally 
are unable to efficiently degrade otherwise short- 
lived" proteins. These ceils can' be 'used to produce' 
desired proteins that ordinarily would be unstable 
within a cell. '' 1 ; - ; 

The invention' is' illustrated further by the 
following "detailed description of the elucidation of 
the N-end "rule !' 1 " - ' ' "-"'- " ' 

Methods " ' ' '•'• : ' ' ' • - •' • : . " 

Protein Secruencincf '' .. « 

S. cerevigjae cells' carrying pUB23 (Fig." l) , 
which encodes "ub-Met-^gal (Fig. 3A) were' labeled ' 
with [ S] methionine, followed by extract pre- 
paration,' immunoprecipitation of ^gal" and" elect-' 
rophoresis" as described below. The wet polyacryla- 
mide gel was subjected to autoradiography, the band 
of ^gai was excised, and the electroeluted pgal was 
subjected to six cycles of ' radiochemical sequencing 
by Edinan degradation."' The sequencing was carried 
out by W. Lane at the MicroChem Facility of Harvard 
University. 1 - "'■ ' -•'' ■ 
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Site-directed Mutagenesis . . , 

pUB23 (figure. 1) was treated sequentially with 
Acc I,, the Kl enow fragment of . ppl. I,., and Bam HI- A 
fragment containing the Xhp v I site was. purified.; and 
inserted between a filled-in Hind . Ill site and a BAM 
HI site of the M13mp9 phage ._DNA. . (J. Messing, .and >X. 
Vieira, Gene 19, 263/(1982)).., Site-directed muta- . 
genesis (M. Smith, Annu , Rev . Genet . 19, 42.3 ..,(198.5) ) 
was carried out t as described by.. Kramer, W. et al . 
Nucl. Acids Res . 12 , 9441 (1984) using a . synthetic 
25-residue oligodeoxyribqnucleotide, .containing ten 
bases on the 5' sid^ and twelve bases on the 3' side 
of the Met codon of p gal . All four bases , were 
allowed to. occur at the original Met codon positions 
during synthesis. Primary phage plaques were . ■ . .. 
screened by hybridization (Wood, N.I., PNAS .. 

82 , 1585 (1985)) , with the use .of a li-residue 
oligonucleotide probe spanning the region , of .codon 
changes and hybridizing to. the original sequence. •„ 
Nonhybridizing plaques containing inserts of the. 
expected size were sequenced by the chain termina- t 
tion method. (Sanger, F. et al.. , PNAS 71 5463. 
(1977) ) . To transfer the desired constructs into 
the pUB23 background, replicative form DNA of mutant 
phages was digested with Xho I and Bam HI, and added 
to the same digest of the plasmid pLGSD5-ATG (see , 
Fig. 1 and L. Guarente, Methods Enzymol . , 101 . 181 
(1983)). The ligated mixture was used to transform 
the E . coli strain MC1061. (M.J. Casadaban and S.N. 
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Cohen, J. Mol. Biol . , 138 179 (1980)). Colonies 
containing- - plasmids -'of interest * (In which the open 
-reading frame of^gal : had been restored) were 
recognized by r their 'light blue color on' X-gal - 
plates.* 7 - ■ ' * ' • •* - : *• 

Pulse-Chase Experiments ' 

S. cerevisiae bells of the strain BWG-9a-l (KAT ' 
his4 ura3 ade6 ) ; transformed (FL" Sherman et al . 
Methods in Yeast Genetics Cold Spring Haxbor Labo- 
ratory, N.Y. 1981) ) with pi asmids of interest were* . 
grown at 3 0°C to A^ 0Q - f bf -approximately- 5 in a medium 
of 2 percent galactose, G'. 67 ! percent "Yeast Nitrogen 
Base without amino abids (DIFCO) , adenine (10 jag/ml) 
and amino acids including methionine (Sherman, F. et 
al. , " supra ) v Typically , : -cells- from a 5 ml culture - 
were harvested by filtration through the well bf ' a : 
Millipoire microtiter filtration plate, washed 
several times' on the filter with the" same medium 
lacking methionine 1 arid' resuspended in 0.3 ml of 1 
percent galactose, 5t> mM' potassium phosphate" buffer 
(pH 7.4). I * S : ]-ntethionine (50 to 100 AiCi) was thW 
added for 5 miniates at 30°C; the cells ¥eie col^- ; 
lected By filtration and resuspended on 0.4 ml of 
the growth medixim containing cycloheximide at 0.5 
mg/ml. Saiaples : (b . 1 ml) u were 1 ' withdrawn at indicated 
times, "and added to 6.75 J ml of cold buffer A (s£e 
below for buffer composition) containing leupeptiri^: 
pepstatin A, antipain, ' aprotinin and chymostatin ' 
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(Sigma) , (each at\ 2-Q^g/ml) in .addition: to r 0* 4 ml of 

glass beads. ;. Immediately thereafter,, the cells , were 

disrupted by -vortexing .for-- approximately 3 minutes * 

at 4°C; the extracts were centrifuged at 12,00.0.g for 

3 minutes and the radioactivity of acid-insoluble 
3 5 

S in the supernatants was determined. Aliquots. of 

the supernatants, ^containing equal amounts of the. 

3 5 - • •- 

total acid-insoluble . S^wem processed for immuno- 

precipitation with a monoclonal -antibody to : .gal.. 
Ascitic .fluid, containing a .molar, excess of the 
.antibody , (at. least .tenfold) -^was added to -each 
aliquot, , with subsequent ■incubation- at 4°C for 2 A ; 
hours; protein ...A-Sephar-ose (Pharmacia),-, was .then :. 
added, the suspension was incubated with rocking at, 
4°C, for 30 minuses, and centrifuged at 12, OOOg, for 1 
minute. The protein A-Seph^rose pellets were, washed 
three times in buffer A -(see: below) containing ; ; 0. 1 
percent sodium dodecyl sulfate (SDS) , r;esuspended, in 
an SDS, dithiotreitol , (DTT) ^containing electro- : 
phoretic s^mpjle buffer (U. K. ; Lae^li;,,* Nature 227 680 
(1970) ) , heated at lOO^C for 3. minutes-, and cen- } : - 
trifuged at 12,000g for l t minute. r . ; . Equal aliquots , of 
the supernatants- were subjected to electrophoresis:, 
in a 7 percent discontinuous . polyacrylamide-SDS gel 
(15 by 15 by 0.15^ cm) with, sub^eguent- f lpurography- : 
In some experiments,, the above protocol was not 
used, . but the extracts were ^prepared by boiling 
cells directly t in the presence of SDS, with es- 
sentially the same results. 
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Plasmid pUB23 : (Figs: : r and 3) ; wa's introduced' 
into DS410 # a- miniceil-proQUcing E: coli stfain. 
(N. Stoker, et al-. V in- Transcription and Trans- 
lation: A practical ' Approach B'i, D.'Harnes arid S. JV 
Higgins, : Eds.", " IRL' press, Oxford', ; 1984, p. 153) . 
Minicells'were prepared' and labeled for 60 minutes 
at 36°C with [ 35 S]methioriine' (• '600'ciymmole, 1 
Amersham)- as described by N. Stoker et al.; supra . 

Labeled minicells'were centr if uged; resuspended 
in 2 percent SDS , 10" ■ mM- DTT, ; 10 ; mM Na-HEPES (ph 7.5) 
and- heated at 100°C -for 3 'minutes.' After centri- 
fugation at 12', OOOg for 1 "minute the supernatant was 
diluted 20-fold with buffer A (1 percent Triton 
X-100, 0.15 M NaCl", ' '5 mM Na-EDTA, 50 mM Na'-HEPES, ph 
7.5) , followed by ; the addition of phenylmethyl- " ' 
sulfonyl flubride (PMSF) and N-ethylmaleimide to 0.5 
mM and lOmM, respectively. ' ftfter 4 hours : at 4°C, 
the sample was dialyzed- against buffer A containing 

...... ' 

0.5 mM PMSF ; overnight at'**4 C, and processed for' 
immunoprecipitation (as ' described above) . 

Analysis of ub- ^qal proteins produced in' yeast " 

S. cerevisiae ceils caifrying plasmids of interest 
were grown in 800 ml of a uracil-def icient medium, 
then harvested and -disrupted with glass beads in 
buffer Ik containing ieupeptin, pepstatin A', anti- 
pain, apfotinin and chymbstatin (each at 3 /tg/ml) . 
The extract Was ceritrifuged at 12, OOOg for 3 
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minutes . Saturated . aitfmqnium sulfate was „added to 
the supernatant to aff inal . concentration of. 57": 
percent.. ...After oversight incubation at 4°Cy .the " 
precipitated protein was^cpllected, by centrif uga.tion 
at 23,000g for 3 0 minutes.. The pellet was redis-. 
solved in buffer A cpntaining- pro£ease,:.inhibitors . ,.: 
After clarification at, 12,000g. for 3 minutes," the 
sample was passed through .an affinity column which 
had been prepared by . crosslinking ;an IgG fraction 
from an ascitic, fluid, (containing a monoclonal : 
antibody to. gal to Af f i-Gel ; 10. ( : Bip-Rad) . . ;The IgG -' 
fraction used for crosslinking ,had been, purified, 
from the ascitic, fluid by affinity chromatography o : n 
protein A-Sepharose. . After ; washing with buffer ,A 
lacking ,Tritpix. X-100.,. the .antibody-bound .proteins 
were eluted with 0.25 M glycine^-HCl OpH 2.6) . The; 
eluate was immediately -adjusted to , pH_ 7 . 5 with .1- ,M . 
Na-HEPES (pH 8.5), ; and thereafter, made 0,1 percent 
in SDS. The .sample was concentrated by, ultrafil- . «; 
tration in Gentricon 30 (Amicon) , and ; sub j ected tqv : 
electrophoresis in a 7 percent discontinuous poly- 
acrylamide-SDS gel (U-K v Laemmli, ; Nature (London) ^ \ 
221 , 680 (1970),),. Electroblotting. of proteins . to 
nitrocellulose, and irnmunoblot . analysis with, a - 
peptide-mediated antibody ,, to ubiquitin were per;-^ . 
formed as described by P.S. ; Swerdlow/ ,D. Finley r and 
A. Varshavsky, Analyt . Biocheiru 156,, 147 : (1986.) . 
The same Results . were obtained with a different 
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ahtibody. tp .ubiquifcin , obtained from A. Haas (Univ. of 
Milwaukee Med'.- School).. .. . 

Detaile d Description - of- the . Figures , 

Figure 1 shows . construction of a. ubiguitin- lacZ 
gene fusion. pUB2, a P BR3 2.2 -based genomic DNA< clone 
CE. Ozkaynak, et al. Nature 312 , . 663 (1984) contains 
six repeats of the yeast ubiquit in-coding sequence- 
(open boxes) together -with, the flanking regions ■ 
(jagged- lines) ; pOB2 was modified as shown in the 
diagram by .placing a. Bam HI Site six. bases down- 
stream from the : f irst lubiquitin orepeat .: . This ■„ 
allowed the: construction . of -an in-frame fusion 

(confirmed by .nucleotide sequencing) -between a 
single ubiquitin repeat and. the lacZ gene of the,, 
expression vector pLGSD5-ATG (called G2 in L. 
Guarente, Methods Enzym ol . , 10,1, , 181 (1983 ) ) The 
term »2 A m» denotes a region of the pLGSD-ATG that .... 
contains the replication -origin. and flanking se- 
quences, of . the yeast plasmid called 2 /i* circle- s (See 
L. ; Guarente,, supra) . .Figure 3 B' shows the amino acid 
sequence of , the fusion protein: in the vicinity of 
the ubiquitin-/?gal junction..; '• • : - . 

Figure 2 • shows that . the in vivo half-life of 
gal is a function of its ajnino-terminal residue, 
(lane a) Minicells isolated from .an -Ei coli strain 
carrying pUB23, the initial ub-lacZ fusion (Figs. 1 
and 3B) , were labeled,- with, [ 3 5 S ] methionine, for .60 :. 
minutes at ; 36° C< . with -subsequent- analysis of .^gal: as 
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describedv : The same result was' obtained when the 1 
labeled minicell SDS extract was- combined with an 
unlabeled yeast SDS extract before immunoprecipita- 
tion of gal* (lane, b) - : ,S , c er e vi s i a'e cells carrying 

pUB23 (Fig* '1) , ' which- encodes ^ub-Met-ggal (Fig. 3B) , 

3 5- • 
were, labeled with [ - S] methionine for 5 minutes at 

30°C> with 1 subsequent; 7 analysis;' of p gal. ^ The same 

result was obtained With : the lengths" of the 

[ SJmethionine labeling periods 'from 1 to-30 

minutes, and with yeast extracts produced either - by 

mechanical disruption ■ of cells c ih the- presence of 

protease inhibitors, or by boiling the cells -directly 

in an SDS-containing buff er. (lane c) Same as lane 

a but with - E. ■ coli . cells > carrying the control 

plasmid p'LGSDS - -(called ~ Gl ' in !>. Giiarente, supra .) 

which encodes gal. planes d to g) , S. : cerevisiae - 

cells carrying pUB23 (Fig. 1)-V wh'i'ch. : encodes 'ub-Met- 

^ gal : (Fig* 3A) ,. we ire labeled with"" : [ : ^S-] methionine 

for 5 minutes at 3 0 P C (iane d) followed by a chase^ 

in the presence of - cycloheximide -for 10/30, and 60 « 

minutes (lanes e toi g) } , extrabtion/jXinmunoprecipita- 

tion, and analysis of ^galL -(lanes • h to : j ) S&me'as 

lanes d to f, but with ub^Ile-^gal (see" Fig. 3A) . 

(lanes k; to *m) Same as* Tanes- h to ; jr but with 

ub-Gln-^gal . „ (lanes n-ta q) Same as lanes d to g, 

but with; ub-Leu-^gal. (lanes r to u) Same as lanes - * 

d to. g, . but with: ub-Arg^gal. Designations: ori; 

origin of the' separating gel; r \ib, ubiquitin; j3 gal , 

an -:electrophoretie" band of the pgal ■ protein * 
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containing a specified amino-terminal residue; in 
thisj. terminology > the Met-jsgal po'rtiorr of- ub-Met^ - 
P- gal is designated* as^galv " Arrowheads' dehote a 
metabolically;. stable; about^ 9ttkD degradation product 
of ^gal which .is formed apparently as ; the • resuit of 
an in vivo endoproteblytic cleavage of a proportion 
of short-lived ^gal proteins such as Leu-^gal and 
Arg-^gal (lanes n to u) . " 

Figure 3 shows the changing. amino acid residues 
of .gal at "the: ubiguitin- gal junction. (A)" The 
initial plasmid, pUB23 - (Fig.'i), which encodes ■ - 
ub-Met-^gai, was mutageriized-: as -described -above to 
convert the .original Met codon ATG "at the ub-£gal 
junction into 'codons specifying .19 amino acids' other 
than MetV-( The -original round of mutagenesis shown 
in Figure 3> produced '15 out of 19 possible sub- • 
stituti-ons. : The remaining four substitutions wers : , 
produced later (see Table 1)') The arrowhead in- ' 
dicates the site of the deubiguitinating ih vivo 
cleavage- in the, -has cent fusion protein that occurs " 
with all of. the -fusion proteihs except ub-Pro^ gal 
(see text). All of the coristructions shown encode 
His :as: the: second^ "gal residue. In addition, in ■ 
some of ;the constructions (ub-Met-His-Gly-/?gal; 
ub-Met-Gln-Gly^gal, and ub-Met-Gln-His^Gly-^gai; 
the. last one produced by ah insertion mutation, see 
Table 2 )v either His -or Gin were following Met at 
the ubiquitih-^gai junction, with indistinguishable 
consequences- for the metabolic stabilities of the - 
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corresponding . v gal ; .-prpteins>:' , -(B) The amino: acid: 
sequence (in single -letter^ abbreviations) of ub-Met- 
^gal, the - initial .fusion , protein : (Fig* ; 1 ) ,. in the - 
vicinity of the,, ujb-^gal.. function. Single-letter 
aiino.acid abbreviations : A, /Ala ; C, . Cys ; -D, ; Asp; E r 
Glu; F , Phe; G, Gly ; H,, His;, I* He; K r Lys; L, Leu; 
M, Met; N, Asn; P, Pro; Q, Gln;R, Arg; S f Ser;. T, 
Thr ; V, Val ; W, ; Trp,; Y : , Tyr. , - 

: Figure 4 shows that .ubiquitin- .gai is short- 
lived if not deubiquitinated. : (lanes a to ,;g). S . ^ 
cerevisiae cells ,- carrying plasmids encoding ■ . 
ub-X- p gal fusion; proteins in,; which is the residue 
indicated at the t9P-,of -each lane r were labeled for 
5 minutes at 3 0°C with [ 35 S}methionine : ; followed by 
extraction, ;iinirtunoprecipitation : and analysis of: , 
p gal . Fluorographic exposures f or : these lanes were 
several, times longer than those f or similar patterns 
in Figure. 2 to reveal the multiple ubiquit inaction of 
short-lived f} gal proteins., . (lanes h, i) Pluoro- - . 
graphic overexposure of lanes n,j o in Fig. 2 to 
reveal., .the ."ladder", of multiply - ubiquitinated v.. 
Leu- f ^gal proteins in a pulse-chase experiment r - (zero 
and 10 minjxtes. chase, respectively) , (lane ;j ) Same . 
as lanes a tp g, but with- ub-Pro-^gal . '(lart^-k) 
Sam^ as . lane j, but with ub-Gln-/?gal . v -(lane 1) Same 
as lane j. (lanes m to p).S> cerevisiae qells ; 
carrying a plasmid encoding ub-Pro-y&gal -were .labeled 
for 5 .minutes at 3Q°C with, [ 3 °S ] methionine (lane m<) , 
followed by a chase in the presence of cycloheximide 
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upper-small arrow- to ; thW ^i'ght o'f-ianfr p denotes- •' : 
ub-Pro-^gaL, a smallr'proportibn of which" is" 'still 
• present., after.. 1 houi^ chase . r -The 5 Hotter- small' arrow 
indicates - an apparently "deubiquitinated Pro'-^gal ■. 
that slowly accumulates during chase "and is :, meta- 
bolically stable. .The dot to the -left of lane m ' 
denotes an- endogeneous yeast protein • that is pre- • 
cipitated in : some ' experiments by - 'the antibody used . 
Square brackets '-denote -the multiply ubiquitinated 
p -gal species (see Fig.- >5) . other designation ' are 
as in Figure 2.- '■ r.i^.:; 

Figure 5 shows the : "ladder" i; gal- species ' • 
containing •■ubiquitih. - (lane a) S .'■ cerevisiae 1 cells 
carrying' a plasmid which encodes ub-Gln-^gal, were 
grown and disrupted, "and the extracts processed for 
■isolation of ^gal proteins by "affinity chromato- ■ 
graphy: oh" 'a ^'column with' -immobilized antibody to 
f> • The ' ^ gal 1 proteins thus obtained were 
electrbphoresed in a- polyacrylamide : -SDS gel/ trans- 
ferred' to nit'rb-cellulose, and probed ; witti : an ; 
antibody to ubi'quitin. (lane b) Same as lane 'a, but 
with ub-Pro-^g/a-1.' (lane c) Same as b but a longer ' 
autoradiographic exposure: (lane' d) si cerevisiae 
cells carrying a plasmid which encodes ub-Leu-^gal 
were labeled with [ 3 5 Sj methionine for 5 minutes, : 
with'- subsequent extraction, immunoprecipitatibn and 
electrophoresis ' of ^gal (the same sample as in ' 
Figure 4, lane f) Square 'brackets denote the" 
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multiply labiqiiitinated Gln-^gal species detected 
with, antibody- to \ibiquitin. - r: The. arrow indicates the 
band, of ub-Pro-^3gal<,. - the initial :fusion protein - seen 
in lanes b. ; and *c. ; . The arrowheads: indicates the ■ ^ 
position,, of - the. band of ; deubiquitinated ^gal .. (de- " 
tectable by .either Coomassie staining or, metabolic 
labeling/ but not -with antibody, to ubiquitin). 
derived from, the ub^Gln-^ga.l fusion protein-. . 

Figure. 6 shows. -both prokaryptic . and eukaryotic 
long-lived intracellular proteins have ; stabilizing- 
amino acid residues at their amino-tf=rmini whereas 
secreted proteins exhibit a complementary bias • 

(A) 20.8 long-lived, .directly sequenced,, intra- 
cellular ( noncompartmental i z ed) -proteins with 
unblocked amino-termini r from both .profcaryot^es. (77 .. 
proteins;) and eukaryotes^ (13,1- proteins) were distri- ;•, 
buteqt into . .three groups according to the .nature of 
their amino-terminal residues, as defined by the 
N-end . rule . (Table 1) . All, of the long-lived intra- 
cellular proteins examined bear exclusively stabi- 
lizing residues at their amino-termLni ; In panels vB 
to D /t analogous diagrams are presented for. 243 j < 
secreted eukaryotic proteins (B) , for 37 light and., 
heavy immunoglobulin chains (C) / -and for. 94 secreted 
eukaryotic toxins (D) . r . -Eritries -in C and D are ; 
subsets of entries in B. For proteins in- B to ,D, 
the amino-termini compiled correspond , ; whenever the 
assignment is possible,, to the .most processed form, 
of a protein that is still located within a 
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secreting cell* The data in A to D were manually 
compiled from- tlie entire" set J -6*f complete protein ." 
sequences - available bef ore 1981'. The same con- 
clusions have been, recently reached after a ; more ~ 
detailed " and extensive', ' coirlputer-assisted tabulation 
of protein . amino-termini .using the current National 
Biomedical Research- Foundation database"; The - 
amino-terminal residue's' of Asn; Cys, His : , and Trp 
were -excluded from the" compilation" because - in vivo' 
half-lives of the corresponding gal proteins- are 
still unknown (see, however, the legend to Table 1) . 
Inclusion of this* residues* (Table 1 1) into a recently 
compilation of the same 1 type did not bhange the 
original conclusion . ' Although- the amino-terminal 
Pro was also excluded from the compilation, Pro 
appears to be 'a stabilizing residue f or p gal (Table 
1) , consistent with the frequent presence of Pro : at 
the ^inb-termifii 1 of long-lived honcompartmentalized 
proteins: - ' - : 

2 - - Results and Discussion - 

Rapid ih vivo r d'eubiquitiriation r; of a nascent ubi- 
quitin- gal fusion ' protein 

Branched ubiquitin conjugates in wliich the - 
carboxyl-terminal glycine, of ubiquitin itioieties is 
joined via an isopeptide bond to the ^-amiho groups 
of internal lysine residues in proteins apparently 
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comprise the bulk-rpf ubiquiti^a conjugates in eu- . 
karyotic cells.. ry Joining pf: r ubiquitin to-, the.- amino- : 
terminal • or-aminQ: groups.. pf target- proteins , : to yield 
linear ubiquit in ; .q ; cnjugates /f . may also be chemically, 
feasible.: See, A.-. Hershko^, et al, Y , PNAS USA 81 : 7021 
(19 84) • ... Whether or not linear ,ubiquitin-protein : 
fusions are actually synthesized -in . vivo through; . 
posttranslational en2ymatic conjugation of ubiquitin 
to protein .amino— termini > - .such - proteins can also be ■> 
produced by constructing appropriate chimeric gen^s 
and expressing them . in yiyo. ; : Construction of one . . 
such gene, which encodes yeasty ubiquitin linked to 
gal of. Escherichia coli ,, is -shown in Figure-,!. . . 

When this-gene. is pxpress^d in E. coll ,- the , 
resulting < p gal-containing .protein has ,an apparent . 
molecular mass which is approximately 6 .kD greater c;. 
that that of the control p gal at value consistent 
with the presence of ubiquitin in the protein 
encoded by the chimeric gene. In contrast, when the 
same gene is expressed in yeast, the corresponding 
^gal protein is electrophoretically indistinguish- 
able from the control ^gal. This result is in- 
dependent of the length ,pf the . [ 35 S] methionine 
labeling period (between ! and,. 30 minutes) .- , 
Furthermore, determination of the* amino-terminal ; 
residue in the putative- Met-^gal . (half-life, : tjy 2 20 
hours) by Edman degradation of , the in vivo-labeled, 
gel -purified B -gal . (Figure 2, lane d) directly 



confirmed the presence of the expected Met residue 
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(Figure 3A and Table .1) at its aminctenninus. . - 
independent ^^t^r^^ cleavage off the 
fusion protein occurs immediately after the last tSly 
rescue b f ubiquitin is presented below. , We con- 
clude that in yeast, ubiguitin . is., efficiently ■ 
cleaved off the „asce ; nt ubiguitin- fusion protein, 
Yielding a deubiguitinated ^gal... - [The absence of 
the deubiguitination reaction in E, coli is con- 
sistent with other" lines of evidence, indicating that 
prokaryotes lack both ubiguitin and ubiguitin- 
specific enzymes]. 

The ubiguitin-> -gal junction encoded by the 
chimeric gene, Gly-Met (Figures 1 an d 3B) , is 
identical to the junctions between adjacent repeats 
xn the polyubiguitin precursor .protein, which is 
efficiently processed into mature ubiguitin. Thus 
xt is likely that the. same protease, as yet un- 

characteriz^d biochemically, is responsible both for 
the conversion of polyubiguitin into mature ubi- 
quitm and for the deubiguitination of the nascent 
ubig^itin-^gal. : pro Vi n t; If so, one potential .way 
to inhibit the i^n vivp deubigu itination . of ^ U J_ 
guitm-pgal (and thereby to allow analysis of . 
metabolic consequenc^ of a^ stable .ubiguitin attach- 
ment to/3-gal) would be to f convert the Met residue 

into^W ^ ^ ' UbigU .f t ^ al j«nction (Figure 3B) 
into other ammo acid, residues (Figure. ;3A) . The ■ 
unexpected results of such an approach • are • described 
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The in vivo half-Tife of ^qal is a function of its 
■ amino-terminal residue . ' The J ATG codon which, speci- 
. fies the br-igihal-'Mbt at the ubi- 

quitia junction (Figure 3B) was converted by sited 
directed^" mutagenesis 'into codons specif ying .^19 other 
amino acids '(See Figure 3A and Table 1). These, 
constructions ^differ exclusively " in the first codon 
of £gal at . the ubiquitin- junction (Figure 3A) . 
After each of the -16 plasmids thus designed was 
introduced into yeast, analysis of the corresponding 
gal proteins pulse-labeled in vivo led to the 
following results (Figures ' 2; 4 ,■ and Table 1): ... 

1) With one £xc£pti"6n (see below) , the effi- 
cient deubiquitinatlon of thd nascent ubiquitin-^gal 
occurs irrespective of the riature of the amino acid 
residue of ^ gal at the ubiquitin-^gal junction. 
Thus,- the: 'apparently ubiquit ih-specif ic protease f . 
that cleaves the original ubiquitin-yjgal prptein at 
the Gly-Met junction is generally insensitive to th.^ 
nature^ of the first residue of ^gai at the junction. 
(Figure :.3A and Table -l) . This result, in effect, 
makes it possible to expose different amino acid 
residues at the amino-termini of the otherwise 
identical y3 gal proteins produced in vivo. 

2) The in vivo half-lives of the^3 gal proteins 
thus designed vary from more than 2 0 hours to less, 
than 3 minutes, depending- on the nature of the amino 
acid residue exposed 1 at the amino-terminus of y3gal r 
(Figures 2, 4, and Table 1). Specifically, 
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deubiquitinated ySgal proteins with either Met, Ser, 
Ala, Thr, Val, f Gys or;:Gly Vjat the amino-terminus have 
relatively long in vivo half-lives of 20 hours or 
more (figure 2, lanes d to g, and Table : 1) , similar 
to the half-life of a control /?gal whose gene had 
not been fused to that of ubiquitiiri. ' in striking 
contrast, the p gal proteins with either Arg, Lys, 
Phe, Leu, : Asp or Trp at the amino-terminus have very 
short half -lives, between approximately 2 minutes 
for Arg- p gal and approximately 3 minutes for Lys- 
/3gal, Phe- /3 gal, Leu- p gal. Asp- p gal, Asn-^g gal 
and Trp- ^ gal (Figu're 2, lanes n to u, and Table 1). 
The half-life of ySgal proteins' with amino-terminal 
residues of either Gin, His or Tyr is approximately 
10 minutes (Figure 2 , lanes k tb : m, and Table 1), "* 
while an amino-terminal He or Glu confers on £ ; gal a 
half-life of approximately 30 minutes (Figure 2, 
lanes h to j , and Table 1) . Both pulse-chase and 
continuous labeling techniques were used in these 
experiments and yielded similar results. 

The set of individual amino acids can be 
ordered with respect to the half-lives that they 
confer on gal when exposed at its amino-terminus. 
The resulting rule (Table 1) is referred to as the 
"N-end rule". 
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, Table 1; The , N-end rule 



■ ... . - - . ••• ■•• .\t In vivo. . - ! ' - •■ 

Radius of- . deubi-quitination 

Residue X in gyration • -of- nascent ., ' 

ub-X-^gal . , ,of X(A) ; . ub-X-^gal t 1/2 of X-^gal 

Met 1.80 . .+. ■ : ■' 

Ser . .. ;.. 1. 08 . • .. . •+•• ■ , " , , ; ... •;. 

.■ Ala- '. ; ,0.77 ... , . ., +... r , /. ••. . 

Thr. .... ' ..1.24. ,•,+ . 20 hours 

Val ... . 1-29 .. a - •-+-_ ...... ■• : ■ .-■ . .-; • • 

Gl.y ., ... , P. ■ ,. . + . • • • ' .• • - . ' 

Cys ... : . *' .. •; + • 

lie ... . ..' .„ , .-. 1.56 ... : .. . - .. r ,>. 

Glu . , . ,1.77 - : . +-. .., , ... 30 minutes 

Tyr :.,2.13. . , , ... • . + • - ;• •. 

Gin.. 1.75 . .. .. + ... .10, minutes 

HiS ; . • . . •.. .v ; ; • ' ' ■'- 

Phe 1-90 + . I 

Leu 1.54 + ■ 3 minutes 

Trp + 

Asp 1-43 + 
Asn + 

Lys 2.08 + 
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Arg 



2.38 



2 minutes 



Pro 



1 .-'25 : 



7 . minutes 



*The rate of in - vivo deub'x^itination ;6f ub-Pro-*gal 

is "extremely low. The^ t -j -shown is, that of ;the * 
initial ub-Pro- figal" J= "~ 
lanes j to p) . 



1/1 

fusion protein (see fig, 4, 
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Legend to Table 1 _ , 

k The N-end njle. In vivo , half -lives of pgal 
proteins in the yeast s. cerevisiae were determined 
either by the pulse-chase technique (for short-lived 
gal's; see below), or. by measuring the. enzymatic 
actiy±.ty of . :.gajL .i^h crudd extracts. ...For the/jneasure- 
ment£ of ~0ga l] activity ,. cells growing in-, a galactose- 
containing medium were transferred "to an otherwise 
identical medium lacking galactose and containing 10 
percent glucose. After further growth for at least 
5 hours at 30 C, the ratio of ^gal activities per 
cell before and after shift to glucose was deter- 
mined for each of the ^gal proteins. [ GAL, promoter- 
driven expression of the fusion genes (Figs. 1 and 
3) is repressed in glucose medium] . For shorter- 
lived gal proteins . < 1 hour), the pulse- 
chase technique was usecKas well (Figs. 2 and 4) . 
E^gctrophoretic bands of p gal proteins labeled with 
[ S] methionine in pulse-chase experiments were cut 
out from scintillant-impregnatedf-dried gels similar 
to those of Figs. 2 and 4, and S in the bands was 
determined. The in vivo decay of short-lived gal 
proteins deviated from first-order kinetics in that 
the rate of degradation was lower when measured at 
later (1 hour) time points of the chase, the lower 
rate reflecting either a time-dependent toxic effect 
of cycloheximide or intrinsic characteristics of the 
in vivo degradation process. [Arrest of translation 
is required for an efficient short-term chase in S. 
cerevisiae because of the amino acid pool equili- 
bration problems related to the presence of vacuoles 
in this organism] . The half-life values listed 
below were determined for the first 10 minutes of 
chase. Several lines of evidence (see description 
of Figs. 4 and 6) suggest that Pro is a stabilizing 
residue. The listed radii of gyration of amino 
acids are from. M. Levitt, J. Mol. Biol. 104:59 
(1976) . 
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Amino-t erminal location of . an . amino "acid is ess- ' 
sential for its effect on , . gal halfrlife - 

Site-directed mutagenesis, was. employed to. 
insert a codon specifying; a "stabilizing" amino '.acid 
(in. this 'experiment',' the Met residue) - before the 
first codon of ^gal at the ubi quit in- p gal junction 
(Table 2). insertion of a stabilizing, residue (Met) 
before either another stabilizing residue (Thr) or a 
variety of destabilizing residues (Gin, Lys, and 
Arg) at the ubiquitin-^gai junction invariably . 
results in a long-lived deubiquitinated yBgal (Table 
2) . ; Furthermore, in contrast to ubiquitin-Pro-^gal 
which is not only short-liyed but also resistant to 
deub'iquitination (Figure 4, lanes j "to p/'and Table 
1), 'ubiquitih-Met~Pro-^gai; is ef f iciently devibi- ': 
quitlnated in vivo to yield a long-lived Met-Pro- 
/sgal (Table 2) - These results show' that both the 
identity of amino 'acid residue and its amino-' 1 
terminal location (presumably the presence of a free 
a-amino group)- are essential for its effect on ^gal 
half life. In addition, these results (Table 2) 
further support- the expectation that ubiquitih 
cleavage off - the 'fusion protein occurs immediately 
after: the- last Gly residue of ubiquitin (Figure '3 A).. 
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Table 2: : N^tenninal locat is 


essential- for its effect oh- tfgal hal f -Ti f e 


: Fus ion pro te i h 


t, of deub.iqui tinated . 

v.. • . y ■ , ./ ■ ' " 

tfusipn protein 


ub - Jhr - r £ga1 ■ 
ub - Met - Jhr. - 'flgal *. . 


■ > 20 hours 


:>20 hours 


y 

ub - Gin t £gal 
ub I Met - Gin - 0gal ,. 


... • ,. T =10 minutes 


. . , ; ; :1 >20- hours . - 


: y - ■ 
ub - Lys - tfgal 

ub - Met - Lys -.£gal 


. .'.-3 minutes 
_ . >20 hours . „ • 


■ - •"■ r . ' .... ... 

ub - Arg, - £gal 
ub - fclet r Arg, - jSgal 


= 2 minutes ; :•• 


: i >20 hours » 


^ ub T Pro - tfgal 
lib - Met - Pro - £ga-1 


* 

: ■ ~7\ minutes 
, , ;>20. hours 



Amino-terminal location of an amino acid is essen- 
tial for its effect on £gal half-life. The inser- 
tion mutants were obtained essentially as : described- : 
for the initial set of mutants except that a 32- 
residue oligonucleotide, 5 1 -CCCGGGATCCGTGC (G/C/ 
T/ ) (G/T) CATACCACCTCTTAG was used, containing 14 
bases on the 5 1 side and 15 bases on the 3 f side of 
the ambiguous codon inserted behind the Met codon. 
Bases in parentheses denote ambiguities at the 
positions 16 and 17 in the sequence. Half-lives of 
the corresponding ^gal proteins were determined as 
described in the legend to Table 1. 
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A long-lived cleavage product, of gal is formed 
during- decay of short-lived : ffgaj~' proteins . 

The electrophoretic patterns pf short-lived 
(but not of long-lived) ^gal proteins invariably 
contain a specific, about 90 kp ' cleavage product of. 
p gal (Figure 2, lanes n to u)' which, unlike the " 
parental /3gal species, accumulates during the 
postlabeling (chase) period ' (Figure . 4 , lanes m-p) . 
The 90' KDySgai fragment constitutes a relatively 
small proportion of the initial amount of the 
pulse-labeled ^gal. . Nonetheless, its .'existence 
implies "that an in. vivo endoproteoiytic cleavage can 
rescue a' protein fragment "from the metabolic fate, of 
its short-lived parental protein.' " it remains to be 
seen whether /the resulting possibility of multiple, 
half-lives, within a single protein "species is 

exploited in the design of naturally short-lived _ 
proteins. " " 

Ubiquitin-ggai is short-lived when not deubigui- 
tinated. ■■ ■■■ ■■ - • 

'^i^itlft-Pro-^gai;' the "only ubiquitin r /?gall 
fusion that is not deubiquitinated in vivo" (Fig. 4, 
lanes j to p) , has a half-life of approximately 7 
minutes (Table!) which is' less than. 1 percent of " 
the half-life ' of metabolicaily ' stable ^ gal proteins 
(Table 1) . One interpretation of this result is 
that a metabolicaily stable ubiquitin attachment to 
protein amino-termini is sufficient to signal 
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degradatibn of acceptor pro This inter- 

pretation is consistent" with' earlier biochemical and' 
genetic evidence" that ubiqxiitinatiori of short-lived, 
proteins in a' mammalian 9^11 is essential for their 
degradation. 'At the same time, all ubiguitin-^gal 
fusion proteins " other than ubiquitin-Pro-^gal are 
rapidly deubiquitinated in vivo (Table 1) . Thus., 
the posttranslational amino-termal ubiquitination of, 
proteins may not be involved in an initial recogni- 
tion or commitment step that designates proteins for 
degration in vivo. Whether posttranslational 
amino-t'erminal ubiquitination " (if it actually occurs 
in vivo) is essential for later stages of the 
degradation pathway remains' to be determined. 
Earlier in vitro ^experiments indicated' that prefer- 
ential chemical modification of amino-termini of 
proteolytic substrates inhibits their "degradation in 
an in vitro ubiquitin-dependent proteolytic system. 
Based on these data, it was proposed that amino-teir- 
minal ubiquitination of proteins is essential for 
their degradation. An alternative interpretation of 
the same results is that chemical blocking of : 
proteins ' amino-termini prevents the recognition of 
their amino-termihal residues by the "N-end rule" 
pathway whose initial stages are not necessarily 
ubiquitiri-dependent . * 
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(Fig. 2) reveal that 'the major band' of .'a deubi- ' 
quitinated, short-lived^ gal protein .coexists with a 
"ladder" of larger molecular mass, ^gal-containing 
bands irregularly spaced at 4 to 7 kD intervals. 
(Fig; 4, lanes c to - gj . No" such larger species 
appear when the fluorbgrams of long-lived ^gal 
proteins-are similarly overexposed (Fig. 4, lanes a 
and b) . Immunological analysis with both antibodies 
to^gal and antibodies to ubiquitin demonstrates. . 
that the "ladder" ^gai species cbritain ubiquitin 
(Fig. 5). ; ' " 1 ' • : "• 

A model fo r 'the selective degradation pathway . 

With; the exception of natural or engineered 
ubiquitin fusion proteins (Fig. 1, and Table 1) 
nascent proteins apparently lack ubiquitin moieties.- 
The in /vivo' amino-terminal processing of' nascent,, 
noncompartmehtalized proteins generates their mature, 
aminb-termirii via the action of amino- terminal 
peptidases whose substrate 1 specificities have been . 
partially characterized. (See 'Tsunasawa, S. et al.. 
J. Biol:' Chem. 5382 (1985) ; Boissel, J.P, et.al. 
PNAS USA 82 , 8448 (1985) j. We suggest that the 
amino-termini thus 'generated are recognized by an 
"N-end-reading» enzyme." One .'specific model is that 
a commitment to degrade' a protein molecule , is made 
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as a result of the" recognition of its amino-terminal 
residue by a stochastically , operating, enzyme -whose 
probability of ,f clainping H at the target !s. amino- ■ 
terminus is determined by the N- ; end rule. (Table 1), 

"Once the commitment is made/, it is, followed by ; a 
highly processive ubiquitination^ of the target 
protein which in the r case of f} gal is conjugated to 

more than 15 ubiquitin , moieties per molecule of . gal 
(Fig. 4, lanes c to g, and Fig. .5) . The multiply .. 

ubiquitinated target protein is then degraded by a . 

"down stream" enzyme (1) for which the _ubiquitin - 

moieties of the target 'sqrve as ^either .recognition 

signals or denaturation (unfolding) devices, or . - 

both. 

The ubiguitin-containing "ladder" ^gal. species 
(Fig. 4/ lanes c to f, and Fig. 5) consist of. 
apparently ' branched ubiquitin moeities joined .to ,.the 
€ -amino groups "of internal lysine residues., in p gal . 
Surprisingly, the "ladder" j? gal species' . derived from 
ubiguitin-Pro-^gal are electrophoretically ^ indistin- . 
guishablie from the analogous species of ^gal . whose . 
amino-termirial ubiquitin is cleaved off the nascent 
fusion protein (Fig. 4, lanes j to 1, and Fig. ; 5). 
If the electrophoretically indistinguishable ubiqui- 
tinated y3 gal species are indeed structurally homo- 
logous, these results would be compatible with two ~ 
alternative models in which,, immediately after the 
first ubiquitins are branch-conjugated to yjg.al, 
either a branch -ubiquitinated ubiquitin-Pro-y0gal 
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undergoes amino-terminal deubiquitination or, 
alternatively, and analogous '^gal r '^pecies lacking 
the amino-termirial ubicjuitin moiety reacquires it. - 
Experimental resolution of this ambiguity may : 
establish whether the posttrarislational amino- 
terminal ubiquitination of proteins (if it occurs in 
vivo) plays a . role in the seleictive protein turn- 
over*. '•■ - : - ■ ;-r j ■ ' : • . 

Although both proka'ryotic and eukaryotic 
proteins appear to ; Mlow the N-end ' rule (see 
below) , bacteria- apparently' lack* the*" ubiquitin 
system. 'Thus it is possible that the hypothetical 
N-end-recognizirig protein 1 is more strongly conserved 
between prokaryotes : and* eukarydtek than is the 'rest 
of the selective -degradation pathway. Interest- 
ingly ,."the properties " of a mammalian protein E3 
whose presence is required for ubiquitination of ' 
proteolytic substrate^ by ubiquitin-conjugatihg 
enzymes in vitro are consistent with it 'being a 
component of the N^^nd-recogriizing protein. ' 

The- N-end ruler a nd the : known amino-tendini of 
intracellular proteins . 

The unblocked- amino-terminal residues in 
metabolically stable , lioncompartmentalized proteins 
from both. ptokaryotes ^and eukaryotes are exclusively 
(Fig. 6A) of the stabilizing class (Met/ Ser, Ala, 
Gly f Thr; Val) , that is' the class that' confers long 
in vivo half KLives ' on ^kr ' (Table' 1) . The one 
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short-lived intracellular protein for which- the . 
mature amino-terminus is ; .Kno^iJ..is the. ell-, protein, of 
phage laiabda, the .central component „qf a trigger 
that determines whether t A grows lytically or . 
lysogenizes an infected cell.. . :(.y.S.., Ho, D. Wulff, 
M. Rosenberg, in Regulation .of Gene .Expression , I. 
Booth and C. Higgins, Eds. (Cambridge Univ. Press, 
London, 1986) , p. 79 .F.. Banuett , M. A . Hoyt r L. 
McFarlane, H. : Echols,. I . Herskov-itz., J . Mol. .Biol . 
187 , 213 (1986.) ; M. A. Hoyt, D.M- Knight, A. Das, . 
H.I. Miller, H. Echols, Cell . 31 , 565. : (1982) ; K. 
Nasmyth, Nature (london) 320 ,. ,670 (1983) ). The 
half-life of ell in lambda-infected, E. ,coli is less 
than 3 minutes. Strikingly the mature amino- > . 
terminus of" ell starts with Arg, (Ho, Y..W... et al,, J : . 
Biol . Chem. 257 , 9128 (1982)),. the most ..destabliz ing. 
residue in the N-end rule (Table 1) ..... 

While the destabilizing, amino acids can be. 
either hydrophobic, uncharged hydrpphilic or, charg- 
ed, they share the property of having larger radii 
of gyration than any .of the stabilizing amino acids 
except Met (Table 1). ( . „ . r 

Amino-terminal residues in compartmentalized pro- . , ( 
teins are largely of the destabilizing class . 

Figure 6 illustrates a. striking difference 
between the choice of amino-terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins (A) and in compartmentalized proteins, such 
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as secreted proteins (B) ,- many of which are also 
long-r 1 ived i-iv their respective ' extrace'l lula r com- 
partments . one implication of -^ttiis •finding is' that 
• a single intracellular 'degfad'ation pathway operating 
according to the N-end rule "could iae responsible 
both for -the diversity of in "vivo half -lives of 
intracellular proteins' and : for the selective de- 
struction of- compartmentalized proteins that are 
aberrantly introduced into the intracellular space. 
Some- miscbmpartmentalized proteins may be more 
harmful to the" cell" than others. It is therefore of 
interest that : secreted ; eukaryotic toxins contain 
strongly 1 destabilizing residues ; (Arg, Lys, Leu, Phe, 
Asp") 1 at their amind-te'rmini "more often than the 
general population of secreted "proteins (Fig. 6, 
panels B. to D) . ~ """" : 

The above consideration also suggest that, \ if 
the topological outside of a 'cell,, such as lumens of 
the endoplasmic' reticulum and Goigi, and the extra- 
cellular space, were to have degradation pathways 
analogous to the N-end rule pathway, they could be 
based on "inverted" versions of the N-end rule in 
which the--amin6-terminal residues that are de- "'. 
stabilizing inside the cell are now the stabilizing 
ones arid vice versa: Thus, the methods of the " 
present invention should ' also' be useful for manipu- 
lating the ' metabolic stability and other properties 
of compartmentalized proteins, including secreted 
ones. 
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Possible role of the N-end rule pathway in the 
turnover of long- lived proteins ,,. 

Long-lived intracellular proteins ; with. destabilizing 
(Table 1) penultimate residues .generally retain^ 
their initial amino-terminal methionine residue. 
The amirio-terminal residues^ in. long-lived intracel- 
lular proteins, that do undergo^ amino-terminal .. . 
processing are invariably of ; this, stabilizing class : 
(Table 1)\ An interesting, possibility . that would 
involve the N-end rule pathway in. the. turnover of 
long-lived proteins is that the ^rate-limiting step: 
in the in* vivo degradation of long-lived proteins 
may be a slow aminopeptidase cleavage that . exposes a 
destabilizing" residue,, followed by , rapid degradation 
via the 1 N-end rule pathway. Note that . fine-tuning 
of the rate of degradation may in this ; ,case be a 
function ~of thd rate of aminopeptidase cleavage 
exposing a destabilizing' residue rather. than a, 
function of the residue's destabilizing ^capacity . . 
according to the N-end rule. 

The N-end" rule and selective degradation of short- 
lived and damaged proteins . 

The recognition of polypeptide chain folding . 
patterns' or of .local chemical features .that .target . 
an otherwise long-lived but damaged protein for 
selective degradation in vivo is unlikely to be . ( , 
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mediated directly by the N-end rule pathway. 
Instead, we suggest that' specific proteases (ana- 
logous in function to nucleates that recognize/ 
specific lesions* in' DWiy cleave a targeted protein 
so as to expose a destabilizihg residue "at the 
amino -terminus of one of the two products of a cut. 
One testable prediction of this model is that the 
initial cleavage - products' of the degradation pathway 
should -bear destabilizing residiies at their N-ter- 
mini. The' preferential exposure of destabilizing 
residues at the amino-termini of products of the 
initial protein cleavage^ may be due either to 
intrinsic specificities of the proteases involved or 
simply to' the- v 'fa: ct that a majority of 'the amino 
acids belong to' the destabilizing class (Table i) . 
Furthermore,, initial cleavages of a protein would be 
expected to destabilize aspects of its original 
conformation, thus increasing the probability of 
further ihteirrial cut's. Whether the initial cleavage 
products of a protein 1 would be degraded exclusively 
via the N-ehd ^rute pathway or would have to be 
processed further by additional internal cleavages 
should depend ' bn several factors, such as the 
exposure of -destabilizing residues at the amino- 
termini of initial cleavage products, and the 
relative rates of ^ introduction of internal cuts. In 
this model, the N-end rule pathway should be es- 
sential for degradation of most of the metabolically 
unstable proteins, from chemically damaged, 
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prematurely terminated, improperly .folded and 
miscompiartinentaiizeci ones, to, those that cannot 
assemble into native multisubunit aggregates , and 
finally t to otherwise normal proteins that are 
short-lived in vivo., .Thus, the metabolic in- 
stability, of a protein may be. mediated not only by 
the exposure of a destabilizing residue at its 
amino-terminus, ,but . al?o by local : conformational and 
chemical features . of its polypeptide chain, >that 
result in proteolytic cleavages exposing destabiliz- 
ing residues at amino-termini . of cleavage f 
products. 

For any given protein, a variety of factors in 
addition to. the N-end rule may combine, to modulate- 
its half— life in vivo. Among such factors may be , 
the flexibility and accessibility of the protein's, 
amino-terminus (Thornton, J-M. and Sibanda, B.L. , . J.. 
Mol. Bio . 167 443 (1983)) , the presence r .o.f„ chemical- 
ly blocking amino-terminal groups, such as the acetyl 
group, the distribution of ubiguitinatable lysine 
residues near the amino-terminus, and other variabl- 
es, such as the structure of the carboxyr terminus . 
Since amino-terminal regions of multisubunit pro- 
teins are commonly involved in the interfaces 
between subunits (Thornton, *J.M. and Sibanda, B.L. , 
J. Mol. Bio . 167 443 (1983)), quarternary structure 
of proteins is yet another parameter that is ex- . . r 
pected to modulate the impact of the N-end rule 
pathway on protein half-lives in vivo. Finally as 
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suggested above, the N-end rule pathway may also be 
essential' for the degradation ; df 1 proteins whose " 
initial recognitibh as .'tWrgets^' f or degradation is 
independent of ' the structures : at' their amino-' 
termini . .; : ::<.■■ 

Functional sign ificance of ptasttranalational addi- 
tion of amino ' acids to amino^terraini of : proteihs . ; 
It has been known for many years that in both 
bacteria : and eukaryotes there exists ah unusual 
class of enzymes, aminoacyl-trahsf er RNA-protein 
transferases, which catalyze posttranslatibnal ' 
conjugation of specific amino acids to the "mature 
amino-termini of acceptor proteins in vitro (R.L. 
Soffer, in Transfer RNA: Biological Aspects . D. Soil, 
J.N. Abelson, p.r. s6himmel, Eds. (Cold Spiring" 
Harbor Laboratory, Cold Spring Harbor, NY 1980), 
p4 93;C. Deutch, Methods Enzvmol . 106, 198(1984): A. 
Kaji, H. Kaji, '<£.&. Novell i, J. Biol. Chem . 240. 
1185 (1965) ) . - The posttranslationai addition of 
amino acids to' proteins" in vivo dramatically ac- 
celerates in a stressed' or regenerating tissue , for 
example, after physical injury to axons of nerve " 
cells (S. Shyne-Athwal, R.v. Riccio, G. Chakraborty, 
N.A. Ingolia, Science 2*1, 603 (1986) ; N. A. Ingolia 
— — : ' J \ Neurosci 3 , 24 63 (19 83)). The N-end rule 
provides an explanation for this phenomen6n. We 
suggest that selective" changes in metabolic sta- 
bility of otherwise "undamaged, longlived proteins 



BNSDOCID: <WO 8802406A2 I > 



WO 88/02406 



PCT/US87/02522 



-53- * 

that may., be -reguired. t by ja. chajiged physiological : 
state of the cell are. brought about by post- 
translational addition, pf destabilizing amino acids 
to the amino-termini of target proteins in vivo* 
Strikingly, the known reactions of posttranslational 
addition ;'p,f amino, acids, to proteins , (R* L. Soffer, in 
Transfer RNA: Biological Aspects , Soil, CT.N. 
Abe Is on, P.R." Schimmei, Eds . (Cold Spring Harbor 
Laboratory, Cold. Spring Harbor, -.NY .198 0) , p493;C. : 
Deutch, . Methods Enzymol . f 1Q6 , , 3.9 8 (198 4) : A. Kaj.i, ,H. 
Kaji, . G. D. Novell i, J.. Biol... Chen . 240, 1185 (1965) ; 
S. Shyne-Athwal,. R.V. Riccio, G. Chakrabprty, N.A. 
Inqolia, . Science 231 , 603 (198 6) ; N..A. Ingolia et , 
al. , j. Neurosci 3 ,„ 2463 /( 1983 )), involve largely : -. 
those' amino acids (Arg, Lys, Leu, Phe, and Tyr) that 
are destabilizing according to the N-end rule. (Table _ u 
1) . physiological states in, which addition of 
destabilizing amino acids .to proteins could be 
expected to, occur include entry to and exit from the 
cell cycle, responses to chemical or physical 
stress, 'and specif ic differentiation events, .such .as 
erythroid differentiation and ^spermatogenesis, in 
which a proportion of preexisting,, .otherwise long- 
lived intracellular proteins, is selectively de- 
graded.. .. .. V;; t 

The in vitro degradation of some ' proteolytic. . 
substrates in a ubiquitin-dependent systeiri from . ; 
mammalian reticulocytes has recently been shown to, 
depend on the presence of certain aminoacyl-tRNAs 
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(Ferb6r ' S ' and Crechancver, A. , j. Biol. chem. 261 
3128 (1986)). We suggest- that this phenomenon als7 
reflects a requirement for posttranslational ad- 
dition of specific- destabilizing amino acids to the 
ammo-termini, of proteolytic- substrates . The 
initial proteolytic substrates in ■ question have 
amxno-terminal. residues of Asp br Glu,' both of which 
are destabilizing. according . to 'the" N-end rule 
(Table . 1). This raises an interesting and testable 

possibility that certain amino-terminai residues in 
. proteins may not be directly destabilizing as such 

but only /through their ability to be conjugated to 

other destabilizing -residues . •-• 

Equivalents ? . . 

% Those skilled in the art will recognize, or be 
able to. ascertain using no more 'than' routine ' experi- 
mentation; many equivalents to the specific: embodi- 
ments of the invention described herein. such 1 
equivalents are 'intended to be encompassed by the 
following claims: ■ - - 
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• * ; - j .CLAIMS, ■.. ■ , " r • ' ; " ■■ ' < 

1. A methpd- of. regulating the: stability of an 

intracellular prpteiri,: comprising : modifying the 
. amino-terminus; of -. a/ protein, to provide for 
exposure at the amino-terminus « of a desired 
amino acid of : the stabilizing or < destabilizing 
class according tothe N-ehd rule of protein 
degradation. . . . 

2. A method of destabilizing an; intracellular - 
protein, comprising modifying- the protein to 
provide an amino-terminal amino acid of the de- 
stabilizing class. 

3. .A method of Claim ..2 , therein an amino acid of. 

the destabilizing class is selected ; from f . the - 
group consisting of Isoleucine, .Glutamic acid, 
Tyrosine., Glutamine, Phenylalanine^ Leucine, 
Asparginine, Lysine or Arginine.. 

4 . A method of producing a protein which is more 
or less resistant to degradation than its 
wild-type counterpart, comprising introducing 
into the amino-terminus of the wild-type 
protein a desired amino acid of the stabilizing 
or destabilizing class according to the N-end 
rule of protein degradation to produce a 
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taining a masking protein joined to its amino- 
terminus, the masking protein being cleavable 
at the junction with the amino-terminus of the 
modified protein. 

6. A method of Claim 5, wherein the masking 
protein is ubiquitin. . • 

7. A method of Claim 6, wherein ubiquit in-specific 
protease is used to cleave ubiquitin from the 
fusion protein at the ubiquitin-protein junc- 
tion either in vivo or in vitro . 

8. A method of Claim 4, wherein the masking 
protein is joined to the amino-terminus of the 
modified protein, through, a protein sequence 

" comprising a site for recognition by an endo- 
protease. , 

9. A method of Claim 8 , wherein, the. endoprotease 
is complement factor X fi . 

10. A method of producing a metabolically unstable 
protein comprising producing the protein as a 
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f us ion protein wherein ubiquitin is joined to 
the amiho-terminus of the protein in a manner 
such that the fusion protein cannot be ef- 
ficiently deubiquitinated. 

11. A method" of Claim' 10, wherein the fusion 
protein 1 comprises ubiquitin-proline-prqtein. 

12. A method of Claim 10, wherein the fusion 
protein contains ubiquitin having a modifica- 
tion in the amino acid sequence recognized by 
ubiquitin— specif ic processing protease so as to 
reduce the efficiency of deubiquitination of 
the fusion protein by the protease. 

13. A gene construct encoding .a fusion 'protein, the 
gene construct comprising a DNA sequence 
encoding a masking protein joined to a DNA 
sequence encoding a protein of interest having 
a predetermined amino acid of the stabilizing 
or destabilizing class at its amino- terminus, 
the masking prbtein being cleavable at the 
junction with the amino-terminus of the protein 
of interest such that cleavage results in 
exposure of the amino-terminus of the protein 
of interest. 

14. ; A method of Claim 13 , wherein the masking 
protein is ubiquitin. 



BNSDOCID: <WO 6802406A2 I > 



WO 88/02406 PCT/US87/02522 



-58- 



15- 



A method of producing a protein having a 
predetermined amino-terminal structure, com- 
prising expressing the protein as a fusion 
protein wherein the* amino-terminus of the 
protein is fused to a masking protein which is 
specifically cleavable in vivo . or t in vitro at 
the junction with the amino-terminal amino acid 
of the protein. 

16. A method of Claim 15, wherein the masking 
protein is cleavable by a endoprotease. 

17. A method of Claim 16 r " wherein the masking 
protein is ubiquitin and the cleaving endo- 
protease is ubiquit in-specific protease. 

18. ' a method of 'producing a protein having .a, . . 

predetermined amino-terminal structure, com- 
prising: 

preparing a DNA construct encoding . a 
fusion protein comprising: 

V. st ^ ctural gene encoding the 
protein, the gene having a sequence 
at its 5 f end encoding the pre- 
determined amino-terminal structure 
of the protein; and 
ii) DNA encoding a masking protein linked 
to the S'^end of the structural gene 
encoding the protein,, the masking 



a- 
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protein being specif ically cleavable 
" in vivo or in "vitro at its junction 
with th£ aminb-terminus of the 
protein; and 
b. expressing 'the DNA construct in a host 

cell to produce the fusion protein encoded 
by the DNA construct. 

19 . A method of Claim 18 , wherein the fusion 
protein is cleaved within the host cell to 
release the protein having the predetermined . 
amino-terminal structure . 

20. A method of Claim 18, wherein the fusion 
protein is produced in a cell which lacks. the 
capability to cleave the masking protein. and 
the fusion protein is cleaved in vitro to.* 
release the protein having the predetermined 
amino-terminal structure. 

21. A method of Claim 18, wherein the masking 
protein is ubiquitin. : 

22. *A DNA construct for producing a protein having 

a predetermined aminb-terminal structure, 
comprising: 

a. a structural gene ' encoding the protein, 
: the" gene having a sequence at its 5 1 end 
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enuuaing tne predetermined ajnino-.tei-iuxiiai 
structure' of the protein; and 
b. DNA encoding a masking" protein linked to 
the . 5 ' ; end of the structural gene encoding 
• the" protein, the masking protein being 
specifically cleayable in vivo or in vitro 
at its .junction with the amino-terminus of 
the protein. 

23. A method of Claim 22/ wherein the masking 
\ protein is ubiquitin. • 

24. Isolated ubiquitin-specific protease capable of 
cleaving ubiquitin from the amino-terminus of a : 
protein' to which it is . joined /at the junction 
between ubiquitin and the amino-terminus. *.,of the 

! protein. 

25. - An, isolated DNA sequence encoding ubiquitin7 , 
' specific protease, of Claim 24. 

2:6. A mutant cell line which is conditionally or 
unconditionally? incapable of synthesizing one 
or moire proteases involved in the N-end' .rule 
degradative .pathway; . 
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FIGURE 1 
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FIGURE 6a 
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FIGURE 6b 
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